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4 <110> APPLICANT: Yanagi, Masayuki 

5 Emerson, Suzanne 

6 Bukh, Jens 

7 Purcell, Robert 

9 <120> TITLE OF INVENTION: CLONED GENOME OF INFECTIOUS HEPATITIS C 
10 VIRUSES OF GENOTYPE 2a AND USES THEREOF 

13 <130> FILE REFERENCE: NIH255.001NP 

15 <140> CURRENT APPLICATION NUMBER: US 09/980, 559A 

16 <141> CURRENT FILING DATE: 2002-05-14 

18 <150> PRIOR APPLICATION NUMBER: PCT/USOO/15446 

19 <151> PRIOR FILING DATE: 2000-06-02 

21 <150> PRIOR APPLICATION NUMBER: US 60/137,693 

22 <151> PRIOR FILING DATE: 1999-06-04 
24 <160> NUMBER OF SEQ ID NOS : 70 

26 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

28 <210> SEQ ID NO: 1 

29 <211> LENGTH: 9711 

30 <212> TYPE: DNA 

31 <213> ORGANISM: Hepatitis C virus 

33 <400> SEQUENCE: 1 

34 acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

35 cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 12 0 

36 ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

37 aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 

38 caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

39 tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360 

40 tcaaagaaaa accaaaagaa acaccaaccg tcgcccacaa gacgttaagt ttccgggcgg 42 0 

41 cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480 

42 cgcgacaagg aagacttcgg agcggtccca gccacgtgga aggcgccagc ccatccctaa 540 

43 agatcggcgc tccactggca aatcctgggg aaaaccagga tacccctggc ccctatacgg 600 

44 gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggttccc gtccctcttg 660 

45 gggccccaat gacccccggc ataggtcgcg caacgtgggt aaggtcatcg ataccctaac 720 

46 gtgcggcttt gccgacctca tggggtacat ccctgtcgtg ggcgccccgc tcggcggcgt 780 

47 cgccagagct ctcgcgcatg gcgtgagagt cctggaggac ggggttaatt ttgcaacagg 840 

48 gaacttaccc ggttgctcct tttctatctt cttgctggcc ctgctgtcct gcatcaccac 900 

49 cccggtctcc gctgccgaag tgaagaacat cagtaccggc tacatggtga ctaacgactg 960 

50 caccaatgac agcattacct ggcagctcca ggctgctgtc ctccacgtcc ccgggtgcgt 102 0 

51 cccgtgcgag aaagtgggga atgcatctca gtgctggata ccggtctcac cgaatgtggc 1080 

52 cgtgcagcgg cccggcgccc tcacgcaggg cttgcggacg cacatcgaca tggttgtgat 1140 

53 gtccgccacg ctctgctctg ccctctacgt gggggacctc tgcggtgggg tgatgctcgc 1200 

54 agcccaaatg ttcattgtct cgccgcagca ccactggttt gtccaagact gcaattgctc 1260 

55 catctaccct ggtaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320 

56 gcccacggct accatgatct tggcgtacgc gatgcgtgtc cccgaggtca ttatagacat 1380 
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57 cattagcggg gctcattggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440 

58 gtgggcgaaa gtcgttgtca tccttctgtt ggccgccggg gtggacgcgc gcacccatac 1500 

59 tgttgggggt tctgccgcgc agaccaccgg gcgcctcacc agcttatttg acatgggccc 1560 

60 caggcagaaa atccagctcg ttaacaccaa tggcagctgg cacatcaacc gcaccgccct 162 0 

61 gaactgcaat gactccttgc acaccggctt tatcgcgtct ctgttctaca cccacagctt 1680 

62 caactcgtca ggatgtcccg aacgcatgtc cgcctgccgc agtatcgagg ccttccgggt 1740 
^3 gggatggggc gccttgcaat atgaggataa tgtcaccaat ccagaggata tgagacccta 1800 

64 ttgctggcac tacccaccaa ggcagtgtgg cgtggtctcc gcgaagactg tgtgtggccc 1860 

65 agtgtactgt ttcaccccca gcccagtggt agtgggcacg accgacaggc ttggagcgcc 1920 

66 cacttacacg tggggggaga atgagacaga tgtcttccta ttgaacagca ctcgaccacc 1980 

67 gctggggtca tggttcggct gcacgtggat gaactcttct ggctacacca agacttgcgg 2040 

68 cgcaccaccc tgccgtacta gagctgactt caacgccagc acggacctgt tgtgccccac 2100 

69 ggactgtttt aggaagcatc ctgataccac ttacctcaaa tgcggctctg ggccctggct 2160 

70 cacgccaagg tgcctgatcg actaccccta caggctctgg cattacccct gcacagttaa 2220 

71 ctataccatc ttcaaaataa ggatgtatgt gggaggggtt gagcacaggc tcacggctgc 2280 

72 atgcaatttc actcgtgggg atcgttgcaa cttggaggac agagacagaa gtcaactgtc 2340 

73 tcctttgttg cactccacca cggaatgggc cattttacct tgctcttact cggacctgcc 2400 

74 cgccttgtcg actggtcttc tccacctcca ccaaaacatc gtggacgtac aattcatgta 2460 

75 tggcctatca cctgccctca caaaatacat cgtccgatgg gagtgggtaa tactcttatt 2520 

76 cctgctctta gcggacgcca gggtttgcgc ctgcttatgg atgctcatct tgttgggcca 2580 

77 ggccgaagca gcactagaga agctggtcat cttgcacgct gcgagcgcag ctagctgcaa 2640 

78 tggcttccta tattttgtca tctttttcgt ggctgcttgg tacatcaagg gtcgggtagt 2700 

79 ccccttagct acctattccc tcactggcct gtggtccttt agcctactgc tcctagcatt 2760 

80 gccccaacag gcttatgctt atgacgcatc tgtgcatggc cagataggag cggctctgct 2 82 0 

81 ggtaatgatc actctcttta ctctcacccc cgggtataag acccttctca gccggttttt 2880 

82 gtggtggttg tgctatcttc tgaccctggg ggaagctatg gtccaggagt gggcaccacc 2 940 

83 tatgcaggtg cgcggtggcc gtgatggcat catatgggcc gtcgccatat tctacccagg 3 000 

84 tgtggtgttt gacataacca agtggctctt ggcggtgctt gggcctgctt acctcctaaa 3060 

85 aggtgctttg acgcgcgtgc cgtacttcgt cagggctcac gctctactga ggatgtgcac 3120 

86 catggcaagg catctcgcgg ggggcaggta cgtccagatg gcgctactag cccttggcag 3180 

87 gtggactggc acttacatct atgaccacct cacccctatg tcggattggg ctgctagtgg 3240 

88 cctgcgggac ctggcggtcg ccgttgagcc tatcatcttc agtccgatgg agaagaaagt 33 00 

89 cattgtctgg ggagcggaga cagctgcttg tggggacatt ttacacggac ttcccgtgtc 3360 

90 cgcccgactt ggtcgggagg tcctccttgg cccagctgat ggctatacct ccaaggggtg 3420 

91 gagtcttctc gcccccatca ctgcttacgc ccagcagaca cgtggccttt tgggcaccat 3480 

92 agtggtgagc atgacggggc gcgacaagac agaacaggct ggggaaattc aggtcctgtc 3 540 

93 cacagtcact cagtccttcc tcggaacatc catctcgggg gttttgtgga ctgtctacca 3600 

94 tggagctggc aacaagactc tggccggctc acggggtccg gtcacgcaga tgtactccag 3660 

95 tgctgagggg gacttagtag ggtggcccag cccccctggg actaaatctt tggagccgtg 3720 

96 cacgtgtgga gcggtcgacc tgtacctggt cacgcggaac gctgatgtca tcccggctcg 3780 

97 aagacgcggg gacaaacggg gagcgctact ctccccgaga cctctttcca ccttgaaggg 3 840 

98 gtcctcagga ggcccggtgc tatgccccag gggccacgct gtcggagtct tccgggcagc 3 900 

99 tgtgtgctct cggggcgtgg ctaagtccat agatttcatc cccgttgaga cactcgacat 3 960 

100 cgtcacgcgg tcccccacct ttagtgacaa cagcacacca cctgctgtgc cccagaccta 4020 

101 tcaggtcggg tacttgcatg ccccgactgg cagtggaaag agcaccaaag ttcctgtcgc 4080 

102 atatgctgct caggggtata aagtgctagt gcttaatccc tcagtggctg ccaccctggg 4140 

103 gtttggggcg tacttgtcta aggcacatgg catcaatccc aacattagga ctggagtcag 42 00 

104 gactgtgacg accggggcgc ccatcacgta ctccacatat ggcaaattcc tcgccgatgg 42 60 

105 gggctgtgcg ggcggcgcct acgacatcat catatgtgat gaatgccatg ccgtggactc 4320 
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106 taccaccatc cttggcatcg gaacagtcct tgatcaagca gagacagctg gggtcagact 4380 

107 aactgtgctg gctacagcta cgccccctgg gtcagtgaca accccccacc ccaacataga 4440 

108 ggaggtggcc cttgggcagg agggcgagat ccccttctat gggagggcga ttcccctgtc 4500 

109 ttacatcaag ggaggaagac atctgatctt ctgccattca aagaaaaagt gtgacgagct 4560 

110 cgcggcggcc cttcggggta tgggcttgaa ctcagtggca tactacagag ggttggacgt 4620 

111 ctccgtaata ccaactcagg gagacgtagt ggtcgtcgcc accgacgccc tcatgacagg 4680 

112 gtatactggg gactttgact ccgtgatcga ctgcaacgta gcggtcactc aagttgtaga 4740 

113 cttcagttta gaccccacat tcaccataac cacacagatt gtccctcaag acgctgtctc 4800 

114 acgtagccag cgccggggtc gcacgggtag gggaagactg ggcatttata ggtatgtttc 4860 

115 cactggtgag cgagcctcag gaatgtttga cagtgtagtg ctctgtgagt gctacgacgc 4920 

116 aggggccgca tggtatgagc tcacaccatc ggagaccacc gtcaggctca gggcgtattt 4980 

117 caacacgccc ggtttgcctg tgtgccaaga ccatcttgag ttttgggagg cagttttcac 5040 

118 cggcctcaca cacatagatg cccacttcct ttcccaaaca aagcaatcgg gggaaaattt 5100 

119 cgcatactta acagcctacc aggctacagt gtgcgctagg gccaaagccc cccccccgtc 5160 

120 ctgggacgtc atgtggaagt gtttgactcg actcaagccc acactcgtgg gccccacacc 5220 

121 tctcctgtac cgcttgggct ctgttaccaa cgaggtcacc ctcacacatc ccgtgacgaa 5280 

122 atacatcgcc acctgcatgc aagccgacct tgaggtcatg accagcacat gggtcttggc 5340 

123 agggggagtc ttggcggccg tcgccgcgta ttgcctggcg accgggtgtg tttgcatcat 5400 

124 cggccgcttg cacattaacc agcgagccgt cgttgcgccg gacaaggagg tcctctatga 5460 

125 ggcttttgat gagatggagg aatgtgcctc tagggcggct ctcattgaag aggggcagcg 5520 

126 gatagccgag atgctgaagt ccaagatcca aggcttattg cagcaagctt ccaaacaagc 5580 

127 tcaagacata, caacccactg tgcaggcttc atggcccaag gtagaacaat tctgggccaa 5640 

128 acacatgtgg aacttcatta gcggcatcca atacctcgca ggactatcaa cactgccagg 5700 

129 gaaccctgca gtagcttcca tgatggcgtt cagtgccgcc ctcaccagtc cgctgtcaac 5760 

130 aagcaccact atccttctca acattttggg gggctggcta gcatcccaaa ttgcaccacc 5820 

131 cgcgggggcc actggcttcg ttgtcagtgg cctagtggga gctgccgtag gcagtatagg 5880 

132 cttaggtaag gtgctagtgg acatcctggc agggtatggt gcgggcattt cgggggctct 5940 

133 cgtcgcattc aagatcatgt ctggcgagaa gccctccatg gaggatgtcg tcaacttgct 6000 

134 gcctggaatt ctgtctccgg gtgccttggt agtgggagtc atctgcgcgg ccattctgcg 6060 

135 ccgacacgtg ggaccggggg aaggcgccgt ccaatggatg aatagactca ttgcctttgc 6120 

136 ttccagagga aatcacgtcg cccccaccca ctacgtgacg gagtcggatg cgtcgcagcg 6180 
13 7 tgtgacccaa ctacttggct cccttaccat aaccagcctg ctcagaagac tccacaactg 624 0 
13 8 gattactgag gactgcccca tcccatgcgg cggctcgtgg ctccgcgatg tgtgggactg 6300 

139 ggtttgcacc atcctaacag actttaaaaa ttggctgacc tccaaattat tcccaaagat 6360 

140 gcccggcctc ccctttgtct cctgtcaaaa ggggtacaag ggcgtgtggg ccggcactgg 6420 

141 catcatgacc acacggtgtc cttgcggcgc caatatctct ggcaatgtcc gcttgggctc 6480 

142 catgagaatc acggggccta agacctgcat gaatatctgg caggggacct ttcctatcaa 6540 

143 ttgttacacg gagggccagt gcgtgccgaa acccgcgcca aactttaagg tcgccatctg 6600 

144 gagggtggcg gcctcagagt acgcggaggt gacgcagcac gggtcatacc actacataac 6660 

145 aggactcacc actgataact tgaaagtccc ctgccaacta ccctctcccg agttcttttc 6720 

146 ctgggtggac ggagtgcaga tccataggtt tgcccccaca ccgaagccgt ttttccggga 6780 

147 tgaggtctcg ttctgcgttg ggcttaattc atttgtcgtc gggtcccagc ttccttgcga 6840 

148 ccctgaaccc gacacagacg tattgatgtc catgctaaca gatccatctc atatcacggc 6900 

149 ggagactgca gcgcggcgtt tagcgcgggg gtcaccccca tccgaggcaa gctcctcggc 6960 

150 gagccagcta tcggcaccat cgctgcgagc cacctgcacc acccacggca aagcctatga 7020 

151 tgtggacatg gtggatgcta acctgttcat ggggggcgat gtgactcgga tagagtctgg 7080 

152 gtccaaagtg gtcgttctgg actctctcga cccaatggtc gaagaaagga gcgaccttga 7140 

153 gccttcgata ccatcagaat acatgctccc caagaagagg ttcccaccag ctttaccggc 7200 

154 ctgggcacgg cctgattaca acccaccgct tgtggaatcg tggaaaaggc cagattacca 7260 
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155 accggccact gttgcgggct gtgctctccc tcctcctagg aaaaccccga cgcctccccc 7320 

156 aaggaggcgc cggacagtgg gcctaagtga ggactccata ggagatgccc ttcaacagct 7380 

157 ggccattaag tcctttggcc agcccccccc aagcggcgat tcaggccttt ccacgggggc 7440 

158 gggcgctgcc gattccggca gtcagacgcc tcctgatgag ttggcccttt cggagacagg 7500 

159 ttccatctct tccatgcccc ccctcgaggg ggagcttgga gatccagacc tggagcctga 7560 

160 gcaggtagag ccccaacccc ccccccaggg gggggtggca gctcccggct cggactcggg 7620 

161 gtcctggtct acttgctccg aggaggacga ctccgtcgtg tgctgctcca tgtcatactc 7680 

162 ctggaccggg gctctaataa ctccttgtag tcccgaagag gagaagttac cgattaaccc 7740 

163 cttgagcaac tccctgttgc gatatcacaa caaggtgtac tgtaccacaa caaagagcgc 7800 

164 ctcactaagg gctaaaaagg taacttttga taggatgcaa gtgctcgact cctactacga 7860 

165 ctcagtctta aaggacatta agctagcggc ctccaaggtc accgcaaggc tcctcaccat 7920 

166 ggaggaggct tgccagttaa ccccacccca ttctgcaaga tctaaatatg ggtttggggc 7980 

167 taaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 8040 

168 cctcctggag gactcagaaa caccaattcc cacaaccatt atggccaaaa atgaggtgtt 8100 

169 ctgcgtggac cccaccaagg ggggcaagaa agcagctcgc cttatcgttt accctgacct 8160 

170 cggcgtcagg gtctgcgaga agatggccct ttatgacatt acacaaaaac ttcctcaggc 8220 

171 ggtgatgggg gcttcttatg gattccagta ttcccccgct cagcgggtag agtttctctt 8280 

172 gaaagcatgg gcggaaaaga aggaccctat gggtttttcg tatgataccc gatgctttga 8340 

173 ctcaaccgtc actgagagag acatcaggac tgaggagtcc atatatcggg cctgctcctt 8400 

174 gcccgaggag gcccacactg ccatacactc gctaactgag agactttacg tgggagggcc 8460 

175 tatgttcaac agcaagggcc aaacctgcgg gtacaggcgt tgccgcgcca gcggggtgct 852 0 

176 caccactagc atggggaaca ccatcacatg ctacgtgaaa gccttagcgg cttgtaaagc 8580 

177 tgcagggata atcgcgccca caatgctggt atgcggcgat gacttggttg tcatctcaga 8640 

178 aagccagggg accgaggagg acgagcggaa cctgagagcc ttcacggagg ctatgaccag 8700 

179 gtattctgcc cctcctggtg acccccccag accggagtat gatctggagc tgataacatc 8760 

180 ttgctcctca aatgtgtctg tggcgctggg cccacaaggc cgccgcagat actacctgac 8820 

181 cagagaccct accactccaa tcgcccgggc tgcctgggaa acagttagac actcccctgt 8880 

182 caattcatgg ctgggaaaca tcatccagta cgccccgacc atatgggctc gcatggtcct 8940 

183 gatgacacac ttcttctcca ttctcatggc tcaagacacg ctggaccaga acctcaactt 9000 

184 tgagatgtac ggagcggtgt actccgtgag tccct'tggac ctcccagcta taattgaaag 9060 

185 gttacatggg cttgacgctt tttctctgca cacatacact ccccacgaac tgacacgggt 9120 

186 ggcttcagcc ctcagaaaac ttggggcgcc acccctcaga gcgtggaaga gccgggcacg 9180 

187 tgcagtcagg gcgtccctca tctcccgtgg ggggagagcg gccgtttgcg gtcgatatct 9240 

188 cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggaag cgcgcctcct 9300 

189 ggatttatcc agctggttca ccgtcggcgc cggcgggggc gacatttatc acagcgtgtc 9360 

190 gcgtgcccga ccccgcttat tgctctttgg cctactccta ctttttgtag gggtaggcct 9420 

191 tttcctactc cccgctcggt agagcggcac acattagcta cactccatag ctaactgtcc 9480 

192 cttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 9540 

193 tttttttttt tttttttttt tttttctttt tttctctttt ccttctttct taccttattt 9600 

194 tactttcttt cctggtggct ccatcttagc cctagtcacg gctagctgtg aaaggtccgt 9660 

195 gagccgcatg actgcagaga gtgccgtaac tggtctctct gcagatcatg t 9711 

196 <210> SEQ ID NO: 2 

197 <211> LENGTH: 3033 

198 <212> TYPE: PRT 

199 <213> ORGANISM: Hepatitis C virus 
201 <400> SEQUENCE: 2 

2 02 Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

203 15 10 15 

204 Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
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Pro 
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Val 
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Val 
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